AERO: A 1.28 MOP/s/LUT Reconfigurable Inference Processor for Recurrent Neural Networks in a Resource-Limited FPGA

نویسندگان

چکیده

This study presents a resource-efficient reconfigurable inference processor for recurrent neural networks (RNN), named AERO. AERO is programmable to perform on RNN models of various types. was designed based the instruction-set architecture specializing in processing primitive vector operations that compose dataflows models. A versatile vector-processing unit (VPU) incorporated every operation and achieve high resource efficiency. Aiming at low usage, multiplication VPU carried out basis an approximation scheme. In addition, activation functions are realized with reduced tables. We developed prototype system using resource-limited field-programmable gate array, under which functionality verified extensively tasks several different The efficiency found be as 1.28 MOP/s/LUT, 1.3-times higher than previous state-of-the-art result.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Artificial Neural Networks Processor - A Hardware Implementation Using a FPGA

Several implementations of Artificial Neural Networks have been reported in scientific papers. Nevertheless, these implementations do not allow the direct use of off-line trained networks because of the much lower precision when compared with the software solutions where they are prepared or modifications in the activation function. In the present work a hardware solution called Artificial Neur...

متن کامل

Discrete recurrent neural networks for grammatical inference

Describes a novel neural architecture for learning deterministic context-free grammars, or equivalently, deterministic pushdown automata. The unique feature of the proposed network is that it forms stable state representations during learning-previous work has shown that conventional analog recurrent networks can be inherently unstable in that they cannot retain their state memory for long inpu...

متن کامل

Recurrent Neural Networks Hardware Implementation on FPGA

Recurrent Neural Networks (RNNs) have the ability to retain memory and learn data sequences. Due to the recurrent nature of RNNs, it is sometimes hard to parallelize all its computations on conventional hardware. CPUs do not currently offer large parallelism, while GPUs offer limited parallelism due to sequential components of RNN models. In this paper we present a hardware implementation of Lo...

متن کامل

OneChip: an FPGA processor with reconfigurable logic

This thesis describes a processor architecture called OneChip, which combines a fixed logic processor core and reconfigurable logic resources. Using the variable components of this architecture, the performance of speed-critical applications can be improved by customizing OneChip’s execution units or flexibility can be added to the glue logic interfaces of embedded controller type applications....

متن کامل

Survival of Extremely Preterm Neonates in a Resource-limited Setting

Background: Preterm birth is a major contributor to neonatal and under-five mortality, and births at lower gestational ages (GA) contribute more to these statistics. Most developed countries have succeeded in improving survival at extremes of GA, while most developing countries like Nigeria still lag behind. The objective of this retrospective study was to document the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2021

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics10111249